


Institutional Archive of the Naval Postgraduate School 





Calhoun: The NPS Institutional Archive 
DSpace Repository 


Theses and Dissertations 1. Thesis and Dissertation Collection, all items 


1975-09 


Analysis of the variable behavior manifested 
in all Navy/Marine major aircraft accident rates 


Maxwell, John Scott; Stucki, Laurence Valdimir 


Monterey, California. Naval Postgraduate School 
http://ndl.handle.net/10945/20748 


This publication is a work of the U.S. Government as defined in Title 17, United 
States Code, Section 101. Copyright protection is not available for this work in the 
United States. 


Downloaded from NPS Archive: Calhoun 


Calhoun is the Naval Postgraduate School's public access digital repository for 
(8 DUDLEY research materials and institutional publications created by the NPS community. 
«ist sae Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS'‘s first 


INN KNOX appointed — and published -- scholarly author. 

| LIBRARY Dudley Knox Library / Naval Postgraduate School 

411 Dyer Road / 1 University Circle 
Monterey, California USA 93943 





http://www.nps.edu/library 








NAVAL POSTGRADUATE SCHOOL 


Monterey, California 









, ANALYSIS OF THE VARIABLE BEHAVIOR 
MANIFESTED IN ALL NAVY/MARINE MAJOR AIRCRAFT 
ACCIDENT RATES 





by 






John Scott Maxwell 


and 







Laurence Valdimir Stucki 






Thesis Advisor: Gary K. Poock 





Approved for public release; distribution unlimited. 


UNCLASSIFIED 


SECURITY CLASSIFICATION OF THIS PAGE (When Dete Frtered) 





READ INSTRUCTIONS 
BEFORE COMPLETING FORM 


S$. TYPE OF REPORT & PERIOO COVERED 
Master's Thesis 
September 1975 


6. PERFORMING ORG. REPORT NUMBER 





REPORT DOCUMENTATION PAGE 


1. REPORT NUMBER 2. GOVT ACCESSION NO. 


4. TITLE (and Subtitle) 
Analysis of the Variable Behavior Mani- 

fested in All Navy/Marine Major Aircraft 
Accident Rates 



























8. CONTRACT OR GRANT NUMBER(e) 





7. AUTHOR(e) 


John Scott Maxwell 
Laurence Valdimir Stucki 












10. PROGRAM ELEMENT. PROJECT, TASK 
AREA & WORK UNIT NUMBERS 


12. REPORT OATE 

September 1975 | 
13, 

56 





9. PERFORMING ORGANIZATION NAME ANO ADDRESS 


Naval Postgraduate School 
Monterey, California 93940 











CONTROLLING OFFICE NAME AND ADORESS 





Hi. 


Naval Postgraduate School 
Monterey, California 93940 





NUMBER OF PAGES 







SECURITY CLASS. (of thie report) 


| Unclassified 





@ MONITORING AGENCY NAME & ADORESS(If different irom Controlling Ollice) | 15. 


Naval Postgraduate School 
Monterey, California 93940 














1Se. OECLASSIFICATION/ DOWNGRADING 
SCHEOULE 


16. DISTRIBUTION STATEMENT (of thie Report) 


Approved for public release; distribution unlimited. 


17. OISTRIBUTION STATEMENT (of the abetract entered {tn Block 20, (f{ different from Report) 


18. SUPPLEMENTARY NOTES 


19. KEY WORDS (Continue on reveree eide if neceeeary and identify by block number) 


Accident Aircraft Prloe Error 





20. ABSTRACT (Continue on reveree side if neceeeary and identify by block number) 


Pol Nayy/ Marine monthiy aireratt accident rates exter a 
behavior of marked variability which cannot be attributed solely 


to weather or other natural phenomena. Variable measures con- 
Strued as time dependent were obtained for all major accidents 
between July 1968 and June 1974. Stepwise linear multiple re- 
gression studies relating the variables to accident rate showed 
pelLOt age, Gaylignt pilot flight hours for the 90 days preceding 





DD ie 1473. — EDITION OF 1! NOV 6815 OBSOLETE UNCLASSIFIED 
S/N 0102°014* 6601 | ig i ee 
(Page 1) SECURITY CLASSIFICATION OF THIS PAGE (When Dete Entered) 


UNCLASSIFIED 


SECURITY CLASSIFICATION OF THIS PAGE(MHen Detea Entered. 


the accident, the number of night carrier landings in the previ- 
ous 30 days, and the number of daylight carrier landings in the 
previous 30 days explained 46.65% of noted accident rate vari- 
ance. The results corroborate previously held theories that 
pilot error is the single largest causal factor in aircraft 
accidents. 


Woes, te UNCLASSIFIED 
S/N 0102-014-6601 SECURITY CLASSIFICATION OF THIS PAGE(When Data Entered) 


ow) 





Analysis of the Variable Behavior Manifested 


in All Navy/Marine Major Aircraft Accident Rates 


by 


wonn Seoct Maxwell 
Lieutenant, United States Navy 
B.A., Purdue University, 1966 
and 
Laurence Valdimir Stucki 


Lieutenant Commander, United States Navy 
B.A., Harvard College, 1963 


Submitted in partial fulfillment of the 
requirements for the degree of 


MASTER OF SCIENCE IN OPERATIONS RESEARCH 


from the 


| 
: 








a 





ABSTRACT 


All Navy/Marine monthly aircraft accident rates exhibit 
a behavior of marked variability which cannot be attributed 
solely to weather or other natural phenomena. Variable meas- 
ures construed as time dependent were obtained for all major 
accidents between July 1968 and June 1974. Stepwise linear 
multiple regression studies relating the variables to acci- 
dent rate showed pilot age, daylight pilot flight hours for 
the 90 days preceding the accident, the number of night car- 
rier landings in the previous 30 days, and the number o£ day- 
light carrier landings in the previous 30 days explained 
46.65% of noted accident rate variance. The results corrob- 
orate previously held theories that pilot error is the single 


largest causal factor in aircraft accidents. 
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Dollar costs and 
each year have often 
determining a viable 
accidents. Research 


marily with attempts 


I. INTRODUCTION 


the number of aviation personnel killed 

been used to emphasize the importance for 
method to reduce Navy and Marine aircraft 
efforts conducted to date have dealt pri- 


to identify and understand the factors 


which were instrumental in causing accidents. Once identi- 


fied, these pertinent factors or variables were to become the 


crux for developing predictive models for accident occurrence. 


Aircraft accidents have been broadly categorized in terms 


of causal factors which were determined from extenSive post- 


accident analysis by 


accident investigation teams. Aircraft 


Or ground support equipment failures, weather, pilot or other 


flight personnel error, maintenance induced equipment mal- 


functions, and shortcomings in design have all been listed 


aS primary cause for 


occurrence of major aircraft accidents. 


An accident is designated as a major accident if: 1) loss 


of life is involved; 


2) complete loss of an aircraft is in- 


volved; or 3) substantial damage occurs to any aircraft in- 


volved where substantial damage is defined in Appendix A of 


OPNAVINST 3750.6 (Series). Of these, the most common cause 


Sited has been pilot 


CrrOnies BrictSonmeametealwess Goes. Studied 


aircraft carrier landing accidents spanning the years 1965 


through 1969. Although the study was limited to attack and 


fimahter arrcraft, the proportion of accidents attributed to 





Polot G€rror is indicative of the inordinately high aircraft 
and personnel loss rates incurred over all aircraft types. 
Approximately seventy-eight percent of accidents studied were 
linked to pilot error as being the primary causal factor. 
Over eight percent of the accidents were attributed to errors 
committed by other supporting personnel, leaving only thir- 
teen percent to be distributed among weather, aircraft fail- 
ure, equipment failure, and other causes. Brictson noted 
that the preponderance of accidents were of two types, hard 
landings and undershooting the landing area. Hard landings 
were more prevalent during daylight hours and undershots dur- 
ing hours of darkness. As expected, small carriers accounted 
for approximately seventy percent of the total accidents even 
though flight activity was less than on large carriers. 
Studies conducted for the Royal Air Force by Goorney 
(1965) were an attempt to subcategorize pilot error into its 
component parts, more molecular in scope than had previously 
been done. He concluded that lack of current flying experi- 
ence, fatigue, complacency, personal worries, and emotional 
stress directly contributed to pilot error and, if monitored, 
Cemmra be used to predict the likelihood of pilot error re- 
lated accidents. He attributed fatigue to pilots having had 
excesSive ground duties prior to flying and aircrew compla- 
cency to lengthy flights in relatively simple aircraft. 
Questionnaires distributed among the ninety accident involved 


pilots making up his sample indicated that emotional stress 





and personal worries were caused by marital, dating, housing, 
financial, and work oriented problems. 

Current flying experience or pilot proficiency has been 
singled out by some analysts as the best predictive measures 
of pilot error accidents. Keller (1961) hypothesized that 
the amount of flight time logged by a pilot during a given 
period was positively correlated with optimal proficiency. 

He intimated that were a pilot to fly the proper amount, he 
would attain a safe, proficient ability as a pilot. Unfortu- 
nately, the hypothesis was not backed with specific guide- 
lines of how to ascertain the necessary hours of flight time 
which would neither favor fatigue nor insufficiency over 
optimal proficiency. 

Collicot, et al. (1972) categorized accident causal fac- 
tors into pilot error, material failure, maintenance error, 
and miscellaneous other causes. In comparing Navy-Marine F-4 
accident rates with Air Force accident rates, the authors 
attributed maintenance error disparities to the fact that 
Air Force F-4 aircraft realized approximately one-tenth of 
the Navy-Marine cannibalization rate to meet sufficient 
Operational aircraft requirements. Naval-Marine officer job 
rotational policies were singled out as adversely affecting 
pilot proficiency. The policies result in lower in-type 
flight hours for Naval-Marine aviators than their Air Force 
counterparts enjoy. The authors also compare accident rates 
of single-seat aircraft with those of dual piloted aircraft. 


They note that when operations required of the aircraft are 





equalized as best possible, the dual piloted aircraft have 
noticeably fewer accidents per ten thousand flight hours than 
do the single-seated versions. Operational F-4 aircraft were 
used to make the comparisons, comparisons limited to instru- 
ment, take-off, and landing phases of flight. Throughout 

the analysis, the authors continually refer to the importance 
of pilot proficiency but now, unlike prior authors, have al- 
luded to the possibility of pilot mental overload being 
critical in the determination of a prime factor in pilot 
error. 

A shift occurred in the emphasis from pilot error induced 
accidents being caused by lack of pilot proficiency to impli- 
cations that the pilot is all too often proficient but may 
be in a state of temporary mental overload. Data extracted 
from National Transportation Safety Board records led 
Kowalsky, et al. (1974) to posit causal factors for the high 
pilot error rate. Pilot error had heretofore inferred that 
practice of pilot procedure was the only method of improving 
pilot proficiency and thus reducing error. Kowalsky and his 
co-researchers applied cluster analysis and pattern recogni- 
tion techniques to their air carrier accident data and found 
that for non-training, non-midair accidents, the single most 
important human causal factor was that pilots were often 
temporarily overloaded and incorrectly evaluated information 
inputted to them during the overload period. Training flight 


accidents were attributed to instructor pilots delaying 


10 





recovery or corrective actions until conditions became too 
far out of tolerance to properly affect recovery. Midair 
collisions were best explained by pilots attempting to meet 
time schedules or destinations where such outcomes were im- 
probable at best. 

Burin (1974) took a slightly different tack in treating 
the problem of aircraft accidents. Rather than dwell on 
accident investigator assigned causal factors, he constructed 
a measure of risk for the twelve individual aircraft models 
which contributed the highest towards total flight hours and 
accident occurrence in the Navy-Marine inventory. Four areas 
of risk were defined to include take-off, in-flight, transi- 
tion, and landing evolutions. Accidents were assumed to occur 
in accordance with a Poisson process. Using data which cov- 
ered Fiscal Years 1969 through 1973 obtained from the Naval 
Safety Center, he constructed a risk index for each aircraft 
considered. The risk index consisted of the data derived 
risk in each of the four phases of flight multiplied by the 
associated percentage of an average flight spent in each 
phase. Although the model does in fact conform to the actual 
individual accident rates observed, it does little to isolate 
one or more specific factors enabling corrective action to be 
taken to reduce the very accident rate it models. 

A great deal of effort has been expended since the advent 
of the Naval Safety Center in maintaining extensive data 


banks of accident related information. Statistical analysis 


Jen 





of available data, much of which can be construed as measures 
of pilot proficiency, should enable predictive mathematical 
models to be constructed. Such an undertaking was attempted 
by Myers (1974). He hypothesized that measures of pilot 
experience and proficiency, then available in collection 
agency data banks, would suffice to form an adequate founda- 
tion for accident rate analysis. He selected ten variables 
SieeactOrs from the Individual Flight Activity Reporting Sys- 
tem (IFARS) data bank to which he later applied statistical 
techniques of principle component analysis and cluster analy- 
sis. The analysis was limited to two groups of fifty pilots 
each. One group was composed of pilots having been involved 
iaearrecraft accidents, the other of pilots free of any acci- 
dent participation. Results were not as pronounced as was 
desired, however. Small sample size is suspect in having 
Surpressed accident predictive results. 

The authors of this writing agree with the basic premise 
promoted by Myers and others. Sufficient data should be cur- 
rently available, from which predictive capability is ex- 
tractable. The variable nature of the monthly accident rate 
suggests underlying factors causal and thus definable in 
their role of accident perpetration. What common factors 
act tO cause accident rate fluctuations? If statistical 
analysis can isolate variable measures associated with pilot 
Beoticiency, aircraft maintenance, flight mission categories, 


or fiscal management which vary directly or inversely with 


de 





accident rate, then predictive and thus preventative know- 
ledge can assist in suppressing dollar and human life costs 


resulting from aircraft accidents. 
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II. NATURE OF THE PROBLEM 


Past monthly major accident rates have been computed for 
all Navy/Marine aircraft by the Naval Safety Center (NSC), 
Norfolk, Virginia. The rate is defined as the total number 
of accidents in a given month multiplied by a constant factor 
of ten thousand and then divided by the total monthly hours 
flown. Major accidents, by definition, are characterized by 
extensive aircraft damage, measured in necessary man-hours to 
Sercet repair if repair is possible, or Hees of life. 

Monthly accident rates exhibit a marked variability when 
each calendar month is compared to other months. Some month- 
ly rates, however, seem to be consistently high. January and 
July rates are higher than the yearly average for four of the 
Six sequential fiscal years beginning with 1968. This phe- 
nomena has also been noted in U.S. Air Force accident rates 
as noted by Zeller and Marsh (1973). Such seasonal trends 
and monthly rate variability cannot be attributed solely to 
weather. The purpose of this paper is to explore accident 
rate dependence on time related variable measures in hopes 
that one or more of these measures can be identified for 


later use in accident rate reduction. 
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TIT. GANAEYTICAL, PROCEDURES 


A. DATA SOURCE 

OPNAVINST 3750.6 (Series) delineates the requirements and 
procedures for reporting each aircraft incident or accident 
involving Naval and Marine aircraft. The type and number of 
different reports required for each accident varies, the ex- 
treme case to include seven separate reports with accompany- 
ing photographs of the crash site, terrain and flight path 
sketches, and detailed statements from knowledgeable witnesses 
or experts. The reports are to be forwarded to NSC for in- 
clusion into their master data bank. Accident data currently 
available from NSC spans the period from the early 1960's to 
the present. Approximately eighty separate variable measures 


are available for each accident occurrence. 


B. DATA SELECTION 

Mhe initial step in the conduct of the current accident 
rate analysis was to select appropriate data points or vari- 
able measures. A data point for an accident was considered 
to be any suitable variable measure associated with the acci- 
dent. Suitable data points could have taken the form of 
accident occurrence date, pilot age, or aircraft model. A 
particular data set consisted of data points for a specific 
accident. 

A sufficient number of data sets had to be incorporated 


into the analysis to facilitate viable statistical results. 


ibs: 





However, the span of time defined by the data sets to be 
analyzed had to be chosen with care. Were a time span chosen 
during which numerous types of aircraft were removed from the 
operational inventories, undesirable effects could have re- 
sulted. Due to the rapid advances which had occurred in air- 
Sateoneeehnology, data from accidents prior to Fiscal Year 
1969 was not deemed suitable for analysis inclusion with the 
present inventory of aircraft. In the opinion of the authors, 
data for accidents which occurred after Fiscal Year 1974 was 
suspected of containing gaps in information due to continu- 
ing investigations by aircraft custodians. The final deci- 
sion was, therefore, to include all major accidents which 
occurred during the Fiscal Years 1969 to 1974, incluSive. 
Two-thousand-one-hundred-ten accidents or data sets available 
within the six-year period selected were all considered suit- 
able for analysis. 

The NSC data bank provided a ready source of numerous 
data points for each accident or data set. Selection of ap- 
propriate data points required that each point be time depend- 
ent. Subjective decisions of time dependency were carefully 
made for each data point considered for analysis. Data point 
time dependency and subsequent selection was based on the 
variable descriptions contained in the Manual of Code Classi- 
mucation for Navy Aircraft Accident, Incident and Ground 
Accident Reporting (Code Manual) promulgated by NSC. The 


number of different data points selected were not governed 
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solely by the current study but also by the requirement for 
follow-on studies to be conducted by this institution. Con- 
tinuity of successive studies must necessarily be driven by 
the consistency of logic used in formulation of a data selec- 
een Criterion. 

Fiscal funding policies at the squadron level was held 
suspect in contributing towards the above average observed 
accident rates for January and July. Financial data of this 
sort, however, was not available from the NSC data bank. An 
extensive search for suitable data proved to be unsuccessful 
and time constraints precluded further endeavors into this 
field. 

A historical breakdown of individual squadron flight hours 
per month and flight hours by type aircraft were not avail- 
able from NSC. The information was deemed necessary for in- 
depth analysis because of its necessity in calculating acci- 
dent rates by aircraft type. Because of the varied flight 
envelopes in which different aircraft operate, some types 
were assumed to contribute more heavily towards the overall 
accident rate variability than were others. To properly ex- 
plain the variability, then, required an analysis at the air- 
craft type level. NSC found a source of the needed aircraft 
type and squadron flight data in digital tape form from the 
Naval Supply Corps maintained Maintenance Data Collection 


System (MDCS) records kept at Mechanicsburg, Pennsylvania. 


Jy 





peeeoral Of thirty ditterentc data points for each of the 
two-thousand-one-hundred-ten aircraft accidents were re- 
guested from NSC. Table 1 lists the variable measures ob- 
tained from NSC computer data banks. All data was received 
from NSC on eighty column Hollerith cards and was encoded in 
accordance with the Code Manual description. 

Ten data points from each data set of thirty possible 
points were selected for inclusion into the current analysis. 
These variables represent broad causal categories, some of 
which were involved in earlier studies mentioned in the intro- 
duction. Table 2 lists the variables selected for the cur- 
rent study. Pilot age and total flight time in the aircraft 
model involved in the reported accident have been considered 
to be measures of pilot experience. Such measures of experi- 
ence should exhibit negative correlation with accident rate 
if they are in fact true measures of experience. Age has 
been considered to be a primary measure of caution by philoso- 
pher and insurance corporations alike when the subject of 
risk has been broached. The Navy has and still is measuring 
pilot experience in terms of "Seat time" logged in applica- 
ble aircraft models. 

Pilot proficiency, a measure of recent hours flown, has 
also been directly tied to the maintenance or increase of 
flight prowess for Naval Aviators. This factor also complies 
with the age old adage of "practice makes perfect" and should 


be negatively correlated to accident rate. Measures of pilot 


ies 





TABLE l 


DATA SET REQUESTED FROM NAVAL SAFETY CENTER 


concerning the pilot: 


Age 

Injuries 

Number of previous service tours 

mecal flying time in aircraft model in which acer@ent 
occurred 

Total flight hours in previous ninety days 

Total nighttime flight hours in previous ninety days 
Potal daylight carrier landings in previous thirty 
days 

Total night carrier landings in previous thirty days 
Number of years as designated Naval Aviator 


concerning aircraft: 


Model 

Damage 

Number of tours between major aircraft rework 
Type of last major inspection 

Hours Since last inspection 

Identification of the system or component failure 


eoncerning the flight: 


Major command 

Reporting custodian 

Ship's hull number (if applicable) 

Marine Air Wing (if applicable) 

Location 

Flight Purpose Code 

Type of operation code 

Phase of operation in which the accident occurred 


concerning the accident: 


Accident identification number including calendar date 
Other aircraft damaged 

Other personnel injured 

SOncCrIDUtinG Caulsel factors 

Special data not otherwise listed 

Weather 

Accident rate for the month in which the accident 
Occurred 


Eo 





TABLE 2 


DATA SET INCLUDED IN CURRENT STUDY 


meee nccident rate by month (RATE) 

2. Pilot's age (AGE) 

3. Total flight time in accident involved aircraft model 
(TTIME) 

4. Total flight time during ninety days preceding accident 
COT 90 ) 

5. Total night flight time during preceding ninety days 
(NITE90O) 

oe Daylight carrier landings during preceding thirty days 
KeIDAY ) 

feeeetiachnt Carrier landings during preceding thirty days 
(CLNITE) 

Bee Number of alrcraft tours (ACTOUR) 

9. Aircraft flight hours since last major or minor inspec- 
meon {ACHRS) 


10. Flight Purpose Code 


proficiency selected for inclusion were total flight time in 
the last ninety days and nights, total flight time in the 
last ninety nights, the number of carrier landings in the 
last thirty days, and the number of carrier landings in the 
last thirty nights. Although flight hour data points were 
available for twenty-four and forty-eight hours prior to 
accident occurrence, these were not selected for inclusion 
to the study because of the uncertainty of whether proficien- 
cy or fatigue would be the true factor underlying each of 
these variables. 

Measures of airframe age and general condition were re- 
presented by the number of major overhauls or rework induc- 
tions which the accident aircraft had undergone (Aircraft 


Hours). Hach alrcraft in the Navy/Marine inventory is 
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required to undergo a Periodic Aircraft Rework (PAR) cycle 
for analysis and repair after a model specific number of 
flight hours have been accumulated. The number of flight 
hours accumulated by the accident aircraft since its last 
major Or minor inspection was selected as a variable measure 
of aircraft condition. This variable was also selected as a 
monitor to explain any reliability anomalies other than "“new- 
better-than-used" life expectations for critical aircraft 
components as mentioned by Butterworth, et al. (1974). 

The Flight Purpose Code was selected as a data point be- 
cause of its categorization of each flight accident in accord- 
ance with an operational mission type. Earlier studies by 
Kowalsky, et al. strongly suggested that training flights 
were a flight category rife with accident potential. Combat 
flights wherein the aircraft was not lost to enemy fire nor 
damage sustained therefrom have historically been credited 
with a less than average accident rate. Inclusion of basic 
flight purpose codes as data points was meant to clarify 


these prior suppositions as either correct or erroneous. 


eee eG IMINARY DATA PREPARATION 
Parametric statistical procedures available for determin- 


ing the relationship between variables require the assumption 


Of normality. The data must also be in the interval measure- 
ment scale. The raw data used included some measurements in 
the nominal and interval measurement scales. The technique 


a 








of averaging by month was used to transform the data into 
interval data. In addition, transformation of the data al- 
lowed the assumption of normality by invoking the Central 
Limit Theorem. 

Data points 2 through 9 of Table 2 were adjudged to be 
interval measures. Raw data for each of these variables was 
averaged by month for each of the seventy-two months within 
the total time span selected. Data point 10 was a nominal 
measure in its raw form and, therefore, required transforma- 
tion to an appropriate list of frequencies for each of the 
three major flight types to be considered. The ship's hull 
number was used as a flag in a computer program to determine 
the proportion of accidents which were attributed to carrier 
based or land based aircraft. The Flight Purpose Code con- 
Sists of a three character alphanumeric code. The second of 
the three characters specifies the basic mission type and was 
used to determine the proportion, by month, of accidents which 
occurred during training flights, general service flights, and 
combat flights. The result of the raw data transformations 
was the creation of eight new variables, six of which were 
included in subsequent parametric analysis. The new varia- 
bles included in later analysis were percent carrier training 
Mmeegnts (CVTRNG), percent carrier service flights (CVSCE), 
percent carrier combat flights (CVCBAT), percent land based 
training flights (LTRNG), percent land based service flights 


(LSVCE), and percent land based combat flights (LCBAT). 
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Variables created but not included in the analysis were per- 
cent carrier based flights and percent land based flights. 
Each percentage dealt only with those aircraft involved in a 
major accident. The new variables were adjudged to be in 


compliance with interval measurement scale requirements. 


te tHE ANALYSIS TECHNIQUE 

The stepwise multiple regression computer program package 
developed by Jae~On Kim and Frank J. Kohout at the University 
of Iowa was selected as the means of conducting the statisti- 
cal analysis of the data set. The program is included in 
Eiemotatistical Package for the Social Sciences (SPSS) com- 
piled and edited by Nie, et al. (1975). Kim and Kohout state 
that stepwise multiple regression is a recognized technique 
to: 1) “find the best linear prediction equation and evaluate 
its prediction accuracy; 2) control for other confounding 
factors in order to evaluate the contributions of a specific 
variable or set of variables; and 3) find structural rela- 
tions and provide explanations for seemingly complex multi- 
variate relationships, such as is done in path analysis." 
The primary purpose, however, is to evaluate and measure over- 
all dependence of a specific variable on a set of other vari- 
ables. The specific variable (dependent variable) used for 
the current study was monthly accident rate and the set of 
other variables (independent variables) consisted of those 


/Mested as 2 through 10 in Table 2. 
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The computer program is designed to provide the user with 
a considerable number of control options. Although the 
majority deal with computer output formats, two of the avail- 
able options can drastically affect the validity of the re- 
gression results. One such option allows the user to include 
all data sets in the computation of correlation coefficients. 
Were each data set (case) complete, the option would be harm- 
less. However, numerous cases contained in the NSC supplied 
data were incomplete. The first three years or thirty-six 
cases lacked three critical pilot proficiency oriented data 
points. Variables 5, 6 and 7 of Table 2 were void of any 
entries for July 1968 through June 1971. Were the option 
invoked, the blank data points would have been evaluated as 
zeros and included into the computation of correlation co- 
efficients. The resultant matrix of correlation coefficients 
would have been grossly biased. 

More insidious but just as damaging would have been the 
use of an option which allowed pairwise deletion of missing 
data to be selected. With this option, a missing value for 
a particular variable causes that case to be eliminated from 
Smbetlations involving that variable only. Such an option 
allows the user to realize maximum sample size if only a few 
missing values appear in his data. However, in the situation 
where numerous values are misSing, particularly if one or a 
few variables account for the bulk of the missing data, the 


sample sizes for the individual variables would not be equal 
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or nearly equal resulting in serious computational inaccurac- 
ies. 

Listwise deletion, a more conservative and accurate 
approach, provides the computer with instructions to delete 
any case from all computation if it contains missing variable 
values. The user's sample size is subject to drastic reduc- 
tion in size but computations integral to the stepwise multi- 
ple regression are insured of being accurate. This option 
was selected for all computer runs included in the current 
study. 

The stepwise multiple regression technique (Appendix C) 
1s particularly useful in studies of the current type because 
as each independent variable is entered into the regression 
equation, the percentage of the total dependent variable's 
variance yet unexplained by the independent variables already 
in the regression is calculated. The percentage of the acci- 
dent rate variability explained by the time related independ- 
ent variables chosen is exactly the type of statistical output 
required to ascertain the causal factors responsible for that 


Meera bility. 
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IV. RESULTS 


Two separate regression calculations were made. The 
first computer run included only data points for the thirty- 
six month period from July 1971 to June 1974. All variables 
except for percent carrier based flights and percent land 
based aircraft were included. The second computer run en- 
compassed the entire six year span but did not include five 
variables. The percent carrier and land based flights were 
not included nor were the three variables with excessive 
missing data for the period of July 1968 to June 1971. There- 
fore, variables 5, 6 and 7 of Table 2 were not included. 

The correlation coefficients obtained for the thirty-six 
month run are included in Table 3. Regression results indi- 
cated that the hierarchical order of variable inclusion as 
governed by the individual variable contributions towards ex- 
plaining accident rate variance was: 1) Pilot age; 2) Total 
Flight time during the previous ninety days; 3) Night carrier 
landings during the preceding thirty days; 4) Daylight car- 
rier landings during the preceding thirty days; 5) Total 
flight time in the accident involved aircraft model; 6) Total 
night flight time during the previous ninety days; 7) Per- 
cent carrier training flights; 8) Percent carrier service 
flights; and 9) Percent land based training flights. Further 
inclusion was inhibited by the program user through imposition 
of an F statistic stopping order for F values of 1.0 or less 
(Appendix D). 
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The program output provided a listing of the multiple 
correlation coefficients (multiple R), Squared coefficient 
values and simple correlation coefficients (simple R). Co- 
efficient values for the predictive regression equation were 
also provided in both standardized and non-standardized form, 
BETA and B respectively. Table 4 is a summary listing of 


computer output provided by the SPSS package. 


TABLE 4 


REGRESSION OUTPUT SUMMARY 
l. July 1978 toedune 974 datapoints 


REGRESSION 

COEBPPIC@ENT 
VARIABLES MULTIPLE R R SQUARE SIMPLE R (B) BETA 
AGE 0. 5449.2 Oms0022 0.54/92 gmO.. O45 7 -0.43384 
TOTIO 0.62462 0". SoS 0.44988 0.00489 OreZ 1o 30 
CLNITE 0267249 0.45224 ~O. 2560 203347 -0.44456 
CLDAY 0.69143 0.47807 SU ES oo )ab O00" 10 0.26089 
Gavi 0.71108 OF 2) USNs Ss) O20 G51] 0.00064 0.37329 
NITE90 Oe 72415 0.5289 2 0 0.02365 Os 5557 
CVTRNG Oe 74359 OF552 34 =05 06672 =0. 00160 a1) 5 (Orc: Share: 
CVSVCE ON Sch 8 0 eis 3785.6) 5) 0.23746 OF OrOoT 0.24413 
LTRNG 0.7 i232 0.59813 OOS. 7 / Omg 03 2a: S22 0.0)/ 
(CONSTANT) 1.10440 


An analysis of variance was conducted for the regression 
to determine the significance for each variable included. The 
null hypothesis for the tests stated that each new variable 
added to the regression did not significantly add to the vari- 
ance for accident rate explained by variables already present 
in the regression. Results of computations outlined in Appen- 
dix D indicated that inclusion of TOT90 with AGE was signifi- 


cant at the 95% confidence level. Entry of CLNITE with AGE 
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and TOT90 was significant at the 90% confidence level, and 
inclusion of CLDAY was significant only at the 75% confidence 
level. Every other variable tested failed the significance 
tests at the 75% confidence level. In equation form the 
regression became: 

om) RATE = 1.1044 - 0.04457 (AGE) + 0.00489 (TOTSO) for 

a 95% confidence level. 
eo RATE = 1.1044 - 0.04457 (AGE) + 0.00489 (TOT90) - 
0.03347 (CLNITE) for a 90% confidence level and 
3) RATE = 1.1044 - 0.04457 (AGE) + 0.00489 (TOT9O) - 
Oe03347 (CENITE) + 0.0071 (CLDAY) for a 75% .contidence 
level. 

Since TOT90 data points included both daylight and night 
flight hours, a second regression was conducted wherein cor- 
rected values for daylight hours only were used. The results 
were different. Order of inclusion into the computation did 
not change but summary output values did change. Table 5 is 
the summary listing of computer output for the regression us- 
ing DAY90 values. 

Analysis of variance testing for determination of the 
Significance effect provided by inclusion of each variable 
gave the following results: 1) CLNITE with DAY90 and AGE was 
Significant at the 90% confidence level; 2) CLDAY with 
CLNITE, DAY90O and AGE was significant at the 75% confidence 
level; 3) No other variables were significant at the 75% 


level or better. In equation form the regression becomes: 
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GABE > 


REGRESSION OUTPUT SUMMARY 
i July 1971 te Jume 1974 Data Poines 


REGRESSION 

CORRE IC EEN 
VARIABLE MURTIREE eR ResOCURRE SIMPLE R (B) BETA 
AGE 0.54792 O. 30022 —-0254792 -0.04457 -0.43384 
DAY 90 O06 29°65 O.S7572 0.41725 0.00489 025253 
CLNITE O65 520 0.42925 —0.25688 ~0708347 -0.44456 
CLDAY 0.68300 0.46648 -0.15891 0. OG gAk0 0.26089 
ir EME OS 6.9602 0.48444 Oa Gel7 0.00064 Omse 329 
NITE9O Ona l5 0.52439 O22 707/ 0.02555 0.42915 
CVTRNG OFT 4s19 Dood y= OR0GG? 2 =O 001s0 =r Oia 
GyoVCE Oia 5 2 0 odes 0.23746 OOo 1 0.24413 
LTRNG Oy 7339 OS 59813 .- 0205787 0 00397 OeZ2067 
(CONSTANT ) 1.10440 


1) RATE = 1.1044 - 0.04457 (AGE) + 0.00489 (DAY90) - 
0.03347 (CLNITE) at the 90% confidence level and 
2) RATE = 1.1044 - 0.04457 


(AGE) + 0.00489 (DAY90) - 


Or05347 (CLNITE) + 0.0071 {(CEBAY) at the 75% conmtid— 
ence level. 

The regression completed for the entire seventy-two month 
period of July 1968 to June 1974 wherein variables with large 
numbers of missing values were removed, served to support the 
Order of inclusion for AGE and TOT90 found earlier. Because 
CLDAY, CLNITE, and NITE90 data points were not included in 
the input to the program, the regression selected LCBAT as 
the third variable for inclusion. No other variables were 
included. Analysis of variance showed TOT90 with AGE to be 
Significant at the 95% confidence level. LCBAT inclusion 


waS Significant only at the 75% confidence level. 
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The residuals for this study are defined as the devia- 
lion of observed accident rate from the estimate accident 
rate obtained from the appropriate regression equation. The 
residuals are used as a basis for computation of the multiple 
correlation coefficients. Direct examination of the residuals 
also provides information relevant to the linearity and nor- 
mality assumptions necessitated by the multiple linear re- 
gression technique. Regression analysis requires assumptions 
of error component independence, component mean of zero, and 
the same component variance throughout the range of dependent 
variable values. The SPSS package provides a visual plot of 
residuals against the predicted values of the dependent vari- 
able determined by the regression equation. Examination of 
the plot failed to reveal the presence of any abnormalities 
indicative of faulty assumptions for each of the computer 


runs made. 
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V. DISCUSSION 


For each of the regression studies made, the pilot's age 
was most instrumental in explaining accident rate variance. 
AGE explained 30.022% of the variance for each of the thirty- 
Six month studies. It accounted for 31.217% of the variance 
for the seventy-two month study. The negative simple corre- 
lation coefficients indicates that as the accident rate in- 
creased, a relatively young age group was involved. Prior 
studies conducted using age aS a variable measure have equated 
age to pilot experience. The authors of this tien agree that 
age 1S a meaSure of experience but not necessarily of pilot 
experience which connotes growing old at the cockpit ccntrols. 
Older people tend to be more rational and prefer risk aver- 
sion to risk taking. The older a person becomes, the less 
impulsive he tends to be. Age also tends to provide a person 
with a larger repertoire of near tragedies from which to draw 
reminders or analogies for current situational positions. 

Daylight flight time accumulated for the ninety day period 
preceding accident involvement, DAY90, accounted for 7.550% of 
the accident rate variance. Prior studies have equated this 
variable to a measure of pilot proficiency, as indeed it logi- 
cally should be. However, the positive simple correlation 
coefficient associated with DAY90 would seem to refute such 
an interpretation. It is unlikely that the more proficient a 
pilot becomes, the more prone he is to accident involvement. 


The ninety day period is too long to attribute large numbers 
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of hours flown to fatigue inducement at the time of the acci- 
dent. Note, however, that the actual distribution of flight 
hours within the ninety day period is not known and, there- 
fore, precludes the authors from discounting the possibility 
of a fatigue factor with any degree of certainty. A more 
likely explanation would tend to follow Goorney's supposition 
that pilot complacency may increase directly as the number of 
meurs Llown and thus contribute to accident occurrence. No 
definitive explanation exists at this date for the positive 
correlation phenomena noted. 

Both CLNITE and CLDAY simple correlation coefficients are 
negative and would seem to support variable categorization as 
measures of pilot proficiency. CLNITE accounts for 5.356% of 
the accident rate variance while CLDAY is responsible for 
feeeoa. Both variables represent the portion of a flight pro- 
file wherein pilot performance is critical. Of particular 
mieerest is the fact that the variable measure representing 
an almost purely instrument flight situation (CLNITE) is more 
critical in the explanation of accident rate variability than 
is the variable measure often associated with a more VFR 
associated flight situation. Again, risk aversion is more 
Pecely to occur under night or instrument flight conditions, 
whereas day or visual flight conditions seem to foster risk 
making. 

Regardless of whether the variable measures included in 


the regression are identified as belonging to pilot 
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Beererency Or pilot experience categories, the fact that all 
Smee were pilot oriented should impress the reader. Pilot 
error of one sort or another is listed as the single largest 
cause of aircraft accidents. Results of this study would tend 
to corroborate the listed causSe as correct. Analysis indi- 
cates that 46.648% of the total accident rate variability is 
explained by pilot related variable measures, if we are will- 
ing to accept statistical confidence levels of 75% as meaning- 


fui. 
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VI. RECOMMENDATIONS 


The current study includes all aircraft communities in 
its treatment of accident rates. To be so general is to miss 
an opportunity to isolate the true causal factors behind the 
accident rate fluctuations. The all Navy/Marine accident 
rate variability 1S composed of many individual components 
generated by different aircraft types. Each aircraft type 
should be analyzed separately in an attempt to ascertain 
commonalities between aircraft communities. Those communities 
which seem to contribute little or no inputs should be re- 
moved from the flight hour data necessary to compute accident 
rates. Aircraft types or communities which contribute heavily 
towards the overall accident rate would most likely not show 
rate fluctuations, each of which were in concert with the 
Others. They would probably act in concert at times and at 
other times be diametrically opposed. 

Accidents involving carrier based aircraft should be 
separated from those involving land based aircraft. Variable 
measures which deal with each community should be relegated 
to correlation studies with the appropriate carrier or land 
based accident rate. To make analytical studies relating 
@aeriler training flights with the accident rates derived from 
all Navy/Marine data may bury some possible correlative results 
which could be most informative. To demonstrate the degree 
in which general comparisons can result in loss of valuable 


information, review the regression outputs contained in Table 
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O@eeand Table 5. Removal of night flight hours from the TOT90 
variable measure prompted a sizable decrease in the associated 
multiple correlation coefficient. 

The authors of this study had as their intended goal, the 
analysis of accident rate by type aircraft. Flight hour data 
necessary to compute accident rates by aircraft type, carrier 
versus land communities, or by major commands was not avail- 
able for use within the time constraints imposed. Follow-on 
studies should be encouraged to pursue a more microscopic 
approach to the problem in these areas. ReSearch involving 
variables categorized as measures of pilot error should hold 
the greatest rewards if the results of this initial study are 
used aS a guide. Fiscal policy data might also provide de- 
finitive answers, particularly in the cases where availability 
of funds is a key determinant in the tempo of operations. The 
relationship between the tempo of operations and the accident 
rate has been the subject of a study conducted by Robino 
(1972) for the Naval Safety Center. The study was somewhat 
broad in scope, but does tend to encourage a more detailed 


study into this area. 
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APPENDIX A 


AVERAGE MONTHLY DATA POINT VALUES 
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APPENDIX B 
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APPENDIX C 
STERWESEe MULTIPLE SREGRESS ION 


Basic multiple regression is an analytical technique where- 
by a linear approximating equation is sought for a dependent 
variable in terms of two or more independent variables. The 
general mathematical model for multiple regression in its un- 


standardized form is 


Y' =A + b,x) tr b,x, “ee ae bX 


where Y' is the regression estimate value of the dependent 
variable, A is the Y intercept constant, b. represents the 
regression coefficients, and X. are the independent variables. 

The objective sought in the conduct of multiple regres- 
sion is to find the best linear predictive equation possible 
by insuring that the sum of squared residuals, © (Y - Y') , 
8: euaemazed:.. By. BEE Ne le sehe squared | re ates 05 
the regression oRaRR TERS a me to maximize the correlation 
between Y, the observed dependent variable value, and Y', the 
regression estimate. 

The goodness of fit of the regression equation can be 
characterized by the proportion of variance explained for Y. 
To do so, the square of the multiple correlation, ae is used. 


R? iSeCadeul ated vbye 
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where SSy is the total variation or sum of Squares in Y, 
SS is the sum of squared residuals, and SS is the re- 

res reg 
gression of squares. The numerator for the right hand por- 
tion of the equation represents the variation in Y explained 
by the combined linear influence of the independent variables. 
The denominator is a measure of the total variation in Y. 

The determination of the regression equation stems from 


the simultaneous solution of the standardized normal equa- 


tions 


By oF Bor i2 4+ Bor 3 slamecueiccs tate Bak = rvy 
Pe ee 54 eer te. > Py 
oo ee ee = 


where B. are the standardized regression coefficients of the 


independent variable Xe and “as are the Pearson or product 


A tt % 


SY motient.-corrélations between’ variable® xy %and’ Kris ee 


Stepwise inclusion of independent variables available in 
the Statistical Package for the Social Sciences (SPSS version 
6) used for this study allows entry of each variable, one at 
a time. Inclusion criteria are such that the order of varia- 
ble entry is dictated by accountability of each for previous- 
ly unexplained variance. Thus, the variable that explains the 
greatest amount of variance previously unexplained by the 


variables already in the equation is the next to be entered. 
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APPENDIX D 
Selo CAL Te oot ORS LGONITETCANCE 


Procedural statistical testing for goodness of fit for the 
regression equation is accomplished by conducting Analysis of 


Variance (ANOVA). The null hypothesis, H used for the test- 


0! 
ing is that the next variable to be added in a stepwise man- 
meeewould not add significantly to the explained variance in 
the dependent variable, Y, already accounted for by variables 
included in the regression equation. The alternative hypo- 
thesis, His directly contradicts the null hypothesis. An 
equivalent statement of Ho would be that all "k" of the re- 
gression coefficients would be identically equal to zero as 
Opposed to the Hy statement of at least one coefficient not 


equating to zero. 


The first step of stepwise regression with forward elimina- 


ally regressed against the dependent variable. F statistics 
are computed for each dependent-independent variable pair. 

The pair having the highest F statistic computed is permanent- 
ly selected for inclusion into the regression equation. The 

F statistic used in the first step is 


r2 
Yl, 1 
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+h! eden “is for each ‘6£ “the “indépendentvar lables “to” be “individus °°" 





where ry) Hothe Simipre correlation, Ry.1,2,...,k is the 
multiple ‘correlation for "k" independent variable inclusion, 
and "k" is the number of independent variables entered. N 
is the sample size used in the particular regression under 
consideration. 

For each successive step in determining the regression 
equation, variables in the equation are retained and independ- 
ent variables not in the equation are temporarily entered for 
F statistic computation. At each step, the combination of 


variables resulting in the highest F statistic is kept. The 


general form of the F statistic used for each successive step 


eS 
(incremental SS due to X, ) r2 
-_ ee Vile 1, 2) kone al 
— 7 ee = rr a 
SS / (N-k-1) (1-R, ) 
res bere + oak / (eee 


where SS is the sum of the squares, Xp 1s the a independent 


* wattable to’-be.‘added,-5S.:/ is-the. sum.of squares~of..the ree tyo% ys, 


Siduals, and ate ae 1S the partial correlation be- 


tween Y and xy when Xe, i = 1,2,...,k-l are held fixed. 

After all independent variables which contribute to the 
explained variance have been added to the regression equation 
in the order dictated by the amount of variance accounted for, 
the significance of each added variable can be determined 


through use of a separate F statistic. The general form for 


mie Statistic is 


eye 





2 


2 
Bie Pt a k-1)!/ 


5 
[1-Ry 1,2,...,k)!/7 (N-k-1) 


where "k" is the number of the variable to be tested for in- 
clusion and M is the degree of freedom commensurate with the 


Variable(s) tested. 
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